Search CORE

52 research outputs found

Metalibm: A Mathematical Functions Code Generator

Author: F. Dinechin de
J.-M. Muller
P.T.P. Tang
S. Chevillard
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

International audienceThere are several different libraries with code for mathematical functions such as exp, log, sin, cos, etc. They provide only one implementation for each function. As there is a link between accuracy and performance, that approach is not optimal. Sometimes there is a need to rewrite a function's implementation with the respect to a particular specification. In this paper we present a code generator for parametrized implementations of mathematical functions. We discuss the benefits of code generation for mathematical libraries and present how to implement mathematical functions. We also explain how the mathematical functions are usually implemented and generalize this idea for the case of arbitrary function with implementation parameters. Our code generator produces C code for parametrized functions within a known scheme: range reduction (domain splitting), polynomial approximation and reconstruction. This approach can be expanded to generate code for black-box functions, e.g. defined only by differential equations

Crossref

Satisfiability Modulo Transcendental Functions via Incremental Linearization

Author: A Cimatti
A Cimatti
A Cimatti
A Eggers
A Solovyev
B Akbarpour
C Weidenbach
F Dinechin de
GL Nemhauser
K Scheibler
M Fränzle
R Cavada
S Gao
S Gao
S Ratschan
V Magron
É Martin-Dorel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/01/2018
Field of study

In this paper we present an abstraction-refinement approach to Satisfiability Modulo the theory of transcendental functions, such as exponentiation and trigonometric functions. The transcendental functions are represented as uninterpreted in the abstract space, which is described in terms of the combined theory of linear arithmetic on the rationals with uninterpreted functions, and are incrementally axiomatized by means of upper- and lower-bounding piecewise-linear functions. Suitable numerical techniques are used to ensure that the abstractions of the transcendental functions are sound even in presence of irrationals. Our experimental evaluation on benchmarks from verification and mathematics demonstrates the potential of our approach, showing that it compares favorably with delta-satisfiability /interval propagation and methods based on theorem proving

arXiv.org e-Print Archive

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Time-division multiplexing vs network calculus: A comparison

Author: Arnaud F.
Boyer M.
Dupont de Dinechin B.
Kasapaki E.
Kasapaki E.
Millberg M.
Sørensen R. B.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

Crossref

Online Research Database In Technology

Automatic Estimation of Verified Floating-Point Round-Off Errors via Static Analysis

Author: A Narkawicz
A Solovyev
A Tarski
C Muñoz
C Muñoz
E Goubault
E Goubault
E Goubault
F Dinechin de
F Kirchner
GG Lorentz
J Harrison
J Harrison
LH Figueiredo de
M Daumas
MM Moscato
S Owre
Publication venue
Publication date
Field of study

This paper introduces a static analysis technique for computing formally verified round-off error bounds of floating-point functional expressions. The technique is based on a denotational semantics that computes a symbolic estimation of floating-point round-o errors along with a proof certificate that ensures its correctness. The symbolic estimation can be evaluated on concrete inputs using rigorous enclosure methods to produce formally verified numerical error bounds. The proposed technique is implemented in the prototype research tool PRECiSA (Program Round-o Error Certifier via Static Analysis) and used in the verification of floating-point programs of interest to NASA

Crossref

NASA Technical Reports Server

Wave Equation Numerical Resolution: a Comprehensive Mechanized Proof of a C Program

Author: B Dutertre
C Barrett
C Marché
CJ Roy
EE Rosinger
EE Rosinger
F Dinechin de
François Clément
GE Andrews
Guillaume Melquiond
H Geuvers
J Avigad
JC Filliâtre
Jean-Christophe Filliâtre
L Cruz-Filipe
M Daumas
M Mayero
Micaela Mayero
P Letouzey
Pierre Weis
R Askey
R Courant
R Gamboa
R O’Connor
S Boldo
S Boldo
S Boldo
S Boldo
S Boldo
S Boldo
Sylvie Boldo
Y Bertot
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/07/2012
Field of study

We formally prove correct a C program that implements a numerical scheme for the resolution of the one-dimensional acoustic wave equation. Such an implementation introduces errors at several levels: the numerical scheme introduces method errors, and floating-point computations lead to round-off errors. We annotate this C program to specify both method error and round-off error. We use Frama-C to generate theorems that guarantee the soundness of the code. We discharge these theorems using SMT solvers, Gappa, and Coq. This involves a large Coq development to prove the adequacy of the C program to the numerical scheme and to bound errors. To our knowledge, this is the first time such a numerical analysis program is fully machine-checked.Comment: No. RR-7826 (2011

arXiv.org e-Print Archive

HAL-ENS-LYON

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Paris 13

Hal-Diderot

HAL-Rennes 1

FPGA acceleration of the phylogenetic likelihood function for Bayesian MCMC inference methods

Author: A Stamataki
A Stamatakis
B Minh
C Than
CL Schoch
D Zwickl
DR Robinson
F de Dinechin
F Ronquist
G Altekar
H Fu
H Schmidt
J Felsenstein
J Felsenstein
J Felsenstein
J Felsenstein
J Williams
Jason D Bakos
JW Spatafora
KH Abed
L Zhuo
M A Suchard
M Binder
ME Alfaro
ML Berbee
N Alachiotis
N Alachiotis
R Bauer
R-C Li
SM Barns
Stephanie Zierke
T Hamada
T Keane
TST Mak
X Feng
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background Likelihood (ML)-based phylogenetic inference has become a popular method for estimating the evolutionary relationships among species based on genomic sequence data. This method is used in applications such as RAxML, GARLI, MrBayes, PAML, and PAUP. The Phylogenetic Likelihood Function (PLF) is an important kernel computation for this method. The PLF consists of a loop with no conditional behavior or dependencies between iterations. As such it contains a high potential for exploiting parallelism using micro-architectural techniques. In this paper, we describe a technique for mapping the PLF and supporting logic onto a Field Programmable Gate Array (FPGA)-based co-processor. By leveraging the FPGA\u27s on-chip DSP modules and the high-bandwidth local memory attached to the FPGA, the resultant co-processor can accelerate ML-based methods and outperform state-of-the-art multi-core processors. Results We use the MrBayes 3 tool as a framework for designing our co-processor. For large datasets, we estimate that our accelerated MrBayes, if run on a current-generation FPGA, achieves a 10× speedup relative to software running on a state-of-the-art server-class microprocessor. The FPGA-based implementation achieves its performance by deeply pipelining the likelihood computations, performing multiple floating-point operations in parallel, and through a natural log approximation that is chosen specifically to leverage a deeply pipelined custom architecture. Conclusions Heterogeneous computing, which combines general-purpose processors with special-purpose co-processors such as FPGAs and GPUs, is a promising approach for high-performance phylogeny inference as shown by the growing body of literature in this field. FPGAs in particular are well-suited for this task because of their low power consumption as compared to many-core processors and Graphics Processor Units (GPUs)

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Scholar Commons - Institutional Repository of the University of South Carolina

Does genetic structure reflect differences in non-breeding movements? A case study in small, highly mobile seabirds

Crossref

Floating-Point Exponentiation Units for Reconfigurable Computing

Author: Belanović P.
Bogdan Pasca
de Dinechin F.
de Dinechin F.
de Dinechin F.
de Dinechin F.
Detrey J.
Doss C. C.
Echeverría P.
Florent de Dinechin
IEEE Computer Society
Louca L.
Marisa López-Vallejo
Pedro Echeverría
Shirazi N.
Wrathall C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Fixed-point trigonometric functions on FPGAs

Author: de Dinechin F.
Detrey J.
Florent de Dinechin
Gal S.
Guillaume Sergent
Matei Istoan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

GPU vs FPGA: A Comparative Analysis for Non-standard Precision

Author: F. Dinechin De
R. Dennard
R.C. Whaley
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Abstract. FPGAs and GPUs are increasingly used in a range of high performance computing applications. When implementing numerical al-gorithms on either platform, we can choose to represent operands with different levels of accuracy. A trade-off exists between the numerical ac-curacy of arithmetic operators and the resources needed to implement them. Where algorithmic requirements for numerical stability are cap-tured in a design description, this trade-off can be exploited to opti-mize performance by using high-accuracy operators only where they are most required. Support for half and double-double floating point repre-sentations allows additional flexibility to achieve this. The aim of this work is to study the language and hardware support, and the achievable peak performance for non-standard precisions on a GPU and an FPGA. A compute intensive program, matrix-matrix multiply, is selected as a benchmark and implemented for various different matrix sizes. The re-sults show that for large-enough matrices, GPUs out-perform FPGA-based implementations but for some smaller matrix sizes, specialized FPGA floating-point operators for half and double-double precision can deliver higher throughput than implementation on a GPU

CiteSeerX

Crossref